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IN THE CLAIMS 

Please amend claims 26, 29, 30, and 32 as follows: 
1-16. (Cancelled) 

17. (Previously Presented) A method for clustering document including generating 
clusters with user respective comprising: 

receiving session logs; 

performing log-based clustering on the session logs to generate session clusters; 
representing each session cluster as a log-based document suitable for content based 
clustering; 

receiving a plurality of documents that includes a first document that was accessed 
in one session and a second document that was not accessed in the sessions; 

replacing (he first document with a log-based document associated with the 
session cluster thai includes the first document; and 

performing content based clustering on at least the first document and the second 
document to generate clusters with user perspective. 

18. (Previously Presented) The method of claim 1 7 wherein representing each session 
cluster as a log-based document suitable for content based clustering includes modifying each 
document referenced in the session cluster so that a Euclidean distance between the 
documents is the same. 

19. (Previously Presented) The method of claim 17, wherein each session log comprises a 
query used to retrieve documents. 

20. (Previously Presented) The method of claim 17, wherein each session log comprises a 
number of documents found to satisfy a query. 
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2 1 . (Previously Presented) The method of claim 1 7, wherein each session log comprises a 
list of documents opened by a user. 

22. (Previously Presented) The method of claim 17, wherein each session log comprises a 
length of time that a document was opened. 

23. (Previously Presented) A method for clustering documents comprising: 
generating i hybrid matrix of vectors comprising a first vector representing a first 

document and a second vector representing a log-based document cluster, and 
clustering the documents using the hybrid matrix. 

24. (Previousl) Presented) The method of claim 23 wherein a second vector is used in 
place of a second document within the hybrid matrix wherein the second document forms a 
portion of the log-based document cluster. 

25. (Previously Presented) The method of claim 23 wherein clustering the documents 
using the hybrid matrix is performed using a content-based clustering technique. 

26. (Currently Amended) The method recited in claim 23 wherein generating the hybrid 
matrix comprises. 

accessing ietrieval session logs; 

clustering retrieval sessions into session clusters; 
j generating, a log-based document cluster for each session cluster by combining all 

documents opened during any retrieval session of the session cluster; 

generating a log-based document cluster vector for each of the log-based document 
clusters; 

replacing €*ach document in the log-based document cluster with the log-based 
document cluster /ector; 

generating an individual document vector for each document not opened during any 

retrieval session; ;ind 
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combining the log-based document cluster vector and the individual document cluster 

vector. 

27. (Previously Presented) The method of claim 26 wherein the step of clustering retrieval 
sessions into session clusters comprises the steps of: 

generating d Boolean session vector for each retrieval session; 

forming a matrix of the Boolean session vectors; and 

applying a clustering algorithm to the matrix of the Boolean session vectors. 

28. (Previously Presented) A system for clustering documents, the system comprising; 
a storage for storing retrieval session logs; and 

a processor connected to the storage, configured to cluster the retrieval sessions into 
session clusters, generate, for each session cluster, a log-based document cluster, generate a 
log-based document cluster vector for each of the log-based document clusters, generate an 
individual document vector for each document not opened during any retrieval session, 
cluster the documents using the log-based document cluster vectors and individual document 
vectors. 

| 29. (Currently Amended) The system of claim ^2Lwherein the documents arc stored in 
the storage. 

I 30. (Currently Amended) The system of claim *K>8further comprising: 
1 a memory connected to the processor, for storage of a hybrid matrix comprising the 

log-based document cluster vectors and the individual document vectors. 

31 . (Previously Presented) A data processing system having session logs and documents, 
the system comprising: 

a processor for executing program instructions; and 

a media readable by the processor having a document clustering module having a 
plurality of instructions, that when executed by the processor, performs log-based clustering 
on the session logs to generate session clusters, converts the session clusters into a form 
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suitable for conteni -based clusters, performs content-based clustering on the documents and 
session clusters in ;i form suitable for content-based clustering to generate document clusters 
with users' perspective. 

| 32. (Currently .\mended) The system of claim 4-3-3 Lwherein the document clustering 
module further comprises: 

a session v(5Ctor generation module for receiving the session logs and based thereon 
for generating a session vector for each session log; 

a session cluster generation module coupled to the session vector generation module 
for receiving the session vectors and based thereon for generating session clusters; 

a hybrid matrix builder for receiving the documents, coupled to the session cluster 
generation module, for receiving the session clusters and based thereon for generating a 
hybrid matrix having at least one log-based document; and 

a topic generation module coupled to the hybrid matrix builder for receiving the 
hybrid matrix and based thereon for generating document clusters with users' perspective. 

33. (Previously Presented) The system of claim 32 wherein the hybrid matrix builder 
further comprises: 

a session document generation module for receiving session clusters and based 
thereon generates super documents; and 

document modification module coupled to the session document generation module 
for receiving the s jper documents, for receiving the documents, and based thereon for 
generating the hybrid matrix. 

34. (Previously Presented) The system of claim 3 1 wherein the media is one of a floppy 
disk, compact disc, a volatile memory, and a non-volatile memory. 

35. (Previously Presented) A machine readable memory device encoded with a data 
structure for clustering documents, the data structure having entries for a log-based document 
cluster vector generated from a log-based document cluster, and an individual document 
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vector corresponding to a vector generated from a first document, the first document not 
belonging to any log based document cluster. 



6 



PAOT 8/16 t RCVD AT 9/18^006 4:39:45 PM [Eastern Dj^Oght Time] * SVR:U^TO€FXRF-3/7 * DNiS:27»300 * CSID:612 573 2005 * DURATION (mm-ss):04-56 



